Picture for Thomas Dooms

Thomas Dooms

When Are Two Networks the Same? Tensor Similarity for Mechanistic Interpretability

Add code
May 14, 2026
Viaarxiv icon

Parameterized Synthetic Text Generation with SimpleStories

Add code
Apr 12, 2025
Viaarxiv icon

Compositionality Unlocks Deep Interpretable Models

Add code
Apr 03, 2025
Viaarxiv icon

Tokenized SAEs: Disentangling SAE Reconstructions

Add code
Feb 24, 2025
Figure 1 for Tokenized SAEs: Disentangling SAE Reconstructions
Figure 2 for Tokenized SAEs: Disentangling SAE Reconstructions
Figure 3 for Tokenized SAEs: Disentangling SAE Reconstructions
Figure 4 for Tokenized SAEs: Disentangling SAE Reconstructions
Viaarxiv icon

Bilinear MLPs enable weight-based mechanistic interpretability

Add code
Oct 10, 2024
Viaarxiv icon

Weight-based Decomposition: A Case for Bilinear MLPs

Add code
Jun 06, 2024
Viaarxiv icon

The Trifecta: Three simple techniques for training deeper Forward-Forward networks

Add code
Nov 29, 2023
Viaarxiv icon